Word | Frequency | Number of left neighbors | Ratio |
---|---|---|---|
በእንግሊዝኛ | 357 | 1 | 357.0000 |
ምሳሌ | 1391 | 4 | 347.7500 |
ካሌንዳር | 338 | 1 | 338.0000 |
2091 | 325 | 1 | 325.0000 |
የቀን | 323 | 1 | 323.0000 |
በአሁኑ | 217 | 1 | 217.0000 |
በዚሁ | 215 | 1 | 215.0000 |
ማስታወሻዎች | 212 | 1 | 212.0000 |
በሙሉ | 200 | 1 | 200.0000 |
ስለሆነም | 180 | 1 | 180.0000 |
ከዚያም | 180 | 1 | 180.0000 |
እንዲሁም | 513 | 3 | 171.0000 |
ዛሬ | 167 | 1 | 167.0000 |
የዓመቱ | 160 | 1 | 160.0000 |
ምን | 156 | 1 | 156.0000 |
ሆኖም | 308 | 2 | 154.0000 |
p. | 154 | 1 | 154.0000 |
የህዝብ | 153 | 1 | 153.0000 |
ማርቆስና | 146 | 1 | 146.0000 |
ልደት | 145 | 1 | 145.0000 |
We look at words having extremely few left neighbors. Because we have an artificial additional word before any sentence beginning, this is the only left neighbor for words appearing only (or mainly) at the beginning of sentences. In addition to the artificial sentence beginning word, there is a corresponding artificial sentence ending word. This is denoted by %$% and has as left neighbors mainly punctuation marks.
Table data:
select word, w.freq, count(c.w1_id), w.freq/count(c.w1_id) as r from words w, co_n c where w.w_id=w2_id and w1_id>100 group by w2_id order by r desc limit 20;
Diagram data:
select w.freq, count(c.w1_id) from words w, co_n c where w.w_id=w2_id and w1_id>100 group by w2_id;
In the above diagram seems to be a larger variation than in the diagram in the previous subsection. What is the reason?
There is no symmetry below and above average. Why?
5.1.7.1 Number of NN co-occurrences vs. Frequency I
5.1.7.3 Number of left vs. right NN co-occurrences